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Abstract 



We propose a framework for studying optimal market making policies in a limit 
order book (LOB). The bid-ask spread of the LOB is modelled by a Markov chain with 
finite values, multiple of the tick size, and subordinated by the Poisson process of the 
tick-time clock. We consider a small agent who continuously submits limit buy/sell 
orders at best bid/ask quotes, and may also set limit orders at best bid (resp. ask) plus 
(resp. minus) a tick for getting the execution order priority, which is a crucial issue in 
high frequency trading. By trading with limit orders, the agent faces an execution risk 
since her orders are executed only when they meet counterpart market orders, which 
are modelled by Cox processes with intensities depending on the spread and on her 
limit prices. By holding non-zero positions on the risky asset, the agent is also subject 
to the inventory risk related to price volatility. Then the agent can also choose to trade 
with market orders, and therefore get immediate execution, but at a least favorable 
price because she has to cross the bid-ask spread. 

The objective of the market maker is to maximize her expected utility from re- 
venue over a short term horizon by a tradeoff between limit and market orders, while 
controlling her inventory position. This is formulated as a mixed regime switching re- 
gular/impulse control problem that we characterize in terms of quasi- variational system 
by dynamic programming methods. In the case of a mean-variance criterion with 
martingale reference price or when the asset price follows a Levy process and with 
exponential utility criterion, the dynamic programming system can be reduced to a 
system of simple equations involving only the inventory and spread variables. 

Calibration procedures are derived for estimating the transition matrix and inten- 
sity parameters for the spread and for Cox processes modelling the execution of limit 
orders. Several computational tests are performed both on simulated and real data, 
and illustrate the impact and profit when considering execution priority in limit orders 
and market orders. 

Keywords: Market making, limit order book, inventory risk, point process, stochastic 
control. 



1 Introduction 



Most of modern equity exchanges are organized as order driven markets. In such type of 
markets, the price formation exclusively results from operating a limit order book (LOB), an 
order crossing mechanism where limit orders are accumulated while waiting to be matched 
with incoming market orders. Any market participant is able to interact with the LOB by 
posting either market orders or limit order^] 

In this context, market making is a class of strategies that consists in simultaneously 
posting limit orders to buy and sell during the continuous trading session. By doing so, 
market makers provide counterpart to any incoming market orders: suppose that an investor 
A wants to sell one share of a given security at time t and that an investor B wants to buy 
one share of this security at time t' > t; if both use market orders, the economic role of the 
market maker C is to buy the stock as the counterpart of A at time t, and carry until date t' 
when she will sell the stock as a counterpart of B. The revenue that C obtains for providing 
this service to final investors is the difference between the two quoted prices at ask (limit 
order to sell) and bid (limit order to buy), also called the market maker's spread. This role 
was traditionally fulfilled by specialist firms, but, due to widespread adoption of electronic 
trading systems, any market participant is now able to compete for providing liquidity. 
Moreover, as pointed out by empirical studies (e.g. [H],[S]) and in a recent review [7] from 
AMF, the French regulator, this renewed competition among liquidity providers causes 
reduced effective market spreads, and therefore reduced indirect costs for final investors. 

Empirical studies (e.g. [11]) also described stylized features of market making strategies. 
First, market making is typically not directional, in the sense that it does not profit from 
security price going up or down. Second, market makers keep almost no overnight position, 
and are unwilling to hold any risky asset at the end of the trading day. Finally, they manage 
to maintain their inventory, i.e. their position on the risky asset close to zero during the 
trading day, and often equilibrate their position on several distinct marketplaces, thanks to 
the use of high-frequency order sending algorithms. Estimations of total annual profit for 
this class of strategy over all U.S. equity market were around 10 G$ in 2009 [7J. 

Popular models of market making strategies were set up using a risk-reward approach. 
Three distinct sources of risk are usually identified: the inventory risk, the adverse selection 
risk and the execution risk. The inventory risk pQ is comparable to the market risk, i.e. 
the risk of holding a long or short position on a risky asset. Moreover, due to the uncertain 
execution of limit orders, market makers only have partial control on their inventory, and 
therefore the inventory has a stochastic behavior. The adverse selection risk, popular in 
economic and econometric litterature, is the risk that market price unfavourably deviates, 
from the market maker point of view, after their quote was taken. This type of risk 

1 A market order of size m is an order to buy (sell) m units of the asset being traded at the lowest 
(highest) available price in the market, its execution is immediate; a limit order of size t at price q is an 
order to buy (sell) I units of the asset being traded at the specified price q, its execution is uncertain and 
achieved only when it meets a counterpart market order. Given a security, the best bid (resp. ask) price 
is the highest (resp. lowest) price among limit orders to buy (resp. to sell) that are active in the LOB. 
The spread is the difference, expressed in numeraire per share, of the best ask price and the best bid price, 
positive during the continuous trading session (see [B]). 
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appears naturally in models where the market orders flow contains information about the 
fundamental asset value (e.g. [5]). Finally, the execution risk is the risk that limit orders 
may not be executed, or be partially executed [10J. Indeed, given an incoming market 
order, the matching algorithm of LOB determines which limit orders are to be executed 
according to a price/time prioritjj^J and this structure fundamentally impacts the dynamics 
of executions. 

Some of these risks were studied in previous works. The seminal work pQ provided 
a framework to manage inventory risk in a stylized LOB. The market maker objective 
is to maximize the expected utility of her terminal profit, in the context of limit orders 
executions occurring at jump times of Poisson processes. This model shows its efficiency to 
reduce inventory risk, measured via the variance of terminal wealth, against the symmetric 
strategy. Several extensions and refinement of this setup can be found in recent litterature: 
|8j provides simplified solution to the backward optimization, an in-depth discussion of its 
characteristics and an application to the liquidation problem. In [2], the authors develop a 
closely related model to solve a liquidation problem, and study continuous limit case. The 
paper [3j provides a way to include more precise empirical features to this framework by 
embedding a hidden Markov model for high frequency dynamics of LOB. Some aspects of 
the execution risk were also studied previously, mainly by considering the trade-off between 
passive and aggressive execution strategies. In |10j . the authors solve the Merton's portfolio 
optimization problem in the case where the investor can choose between market orders or 
limit orders; in [13], the possibility to use market orders in addition to limit orders is also 
taken into account, in the context of market making in the foreign exchange market. Yet 
the relation between execution risk and the microstructure of the LOB, and especially the 
price/time priority is, so far, poorly investigated. 

In this paper we develop a new model to address these three sources of risk. The stock 
mid-price is driven by a general Markov process, and we model the market spread as a 
discrete Markov chain that jumps according to a stochastic clock. Therefore, the spread 
takes discrete values in the price grid, multiple of the tick size. We allow the market maker 
to trade both via limit orders, which execution is uncertain, and via market orders, which 
execution is immediate but costly. The market maker can post limit orders at best quote 
or improve this quote by one tick. In this last case, she hopes to capture market order 
flow of agents who are not yet ready to trade at the best bid/ask quote. Therefore, she 
faces a trade off between waiting to be executed at the current best price, or improve 
this best price, and then be more rapidly executed but at a less favorable price. We 
model the limit orders strategy as continuous controls, due to the fact that these orders 
can be updated at high frequency at no cost. On the contrary, we model the market 
orders strategy as impulse controls that can only occur at discrete dates. We also include 
fixed, per share or proportional fees or rebates coming with each execution. Execution 
processes, counting the number of executed limit orders, are modelled as Cox processes 
with intensity depending both on the market maker's controls and on the market spread. 
In this context, we optimize the expected utility from profit over a finite time horizon, 

2 A different type of LOB operates under pro-rata priority, e.g. for some futures on interest rates. In this 
paper, we do not consider this case and focus on the main mechanism used in equity market. 



3 



by choosing optimally between limit and market orders, while controlling the inventory 
position. We study in detail classical frameworks including mean-variance criterion and 
exponential utility criterion. 

The outline of this paper is as follows. In section 2, we detail the model, and comment 
its features. We also provide direct calibration methods for all quantities involved in our 
model. We formulate in Section 3 the optimal market making control problem and derive the 
associated Hamilton-Jacobi-Bellman quasi variational inequality (HJBQVI) from dynamic 
programming principle. We show how one can reduce the number of state variables to 
the inventory and spread for the resolution to this QVI in two standard cases. In the last 
section 4, we provide some numerical results and empirical performance analysis. 

2 A market-making model 

2.1 Mid price and spread process 

Let us fix a probability space (fi, T , P) equipped with a filtration ¥ = (J r t)t>o satisfying 
the usual conditions. It is assumed that all random variables and stochastic processes are 
defined on the stochastic basis (fl, T, ¥, P). 

The mid-price of the stock is an exogenous Markov process P, with infinitesimal ge- 
nerator V and state space P. For example, P is a Levy process (e.g. an arithmetic Brownian 
motion), or an exponential of Levy process (e.g. geometric Brownian motion). In the limit 
order book (LOB) for this stock, we consider a stochastic bid-ask spread resulting from the 
behaviour of market participants, taking discrete values, which are finite multiple of the 
tick size 5 > 0, and jumping at random times. This is modelled as follows: we first consider 
the tick time clock associated to a Poisson process (Nt)t with deterministic intensity X(t), 
and representing the random times where the buy and sell orders of participants in the 
market affect the bid-ask spread. We next define a discrete-time stationary Markov chain 
(S n )neN> valued in the finite state space § = 5I m , I m := {1, . . . , m}, m G N \ {0}, with 
probability transition matrix (pij)i<ij<M, i-e. P[5 n +i = j5\S n = iS] = pij, s.t. pa = 0, 
independent of N, and representing the random spread in tick time. The spread process 
(St)t in calendar time is then defined as the time-change of S by N, i.e. 

St = S Nt , t>0. (2.1) 

Hence, (St)t is a continuous time (inhomogeneous) Markov chain with intensity matrix R(t) 
= ( r ij(t))i<i,j<m, where ^(t) = X(t)pij for i / j, and ru(t) = - Y^j^i r ij (*) ■ We assume 
that S and P are independent. The best-bid and best-ask prices are defined by: P t 6 = 
p. — St p a — p, _L St 

2.2 Trading strategies in the limit order book 

We consider an agent (market maker), who trades the stock using either limit orders or 
market orders. She may submit limit buy (resp. sell) orders specifying the quantity and 
the price she is willing to pay (resp. receive) per share, but will be executed only when 
an incoming sell (resp. buy) market order is matching her limit order. Otherwise, she can 
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post market buy (resp. sell) orders for an immediate execution, but, in this case obtain 
the opposite best quote, i.e. trades at the best-ask (resp. best bid) price, which is less 
favorable. 

Limit orders strategies. The agent may submit at any time limit buy/sell orders at the 
current best bid/ask prices (and then has to wait an incoming counterpart market order 
matching her limit), but also control her own bid and ask price quotes by placing buy (resp. 
sell) orders at a marginal higher (resp. lower) price than the current best bid (resp. ask), 
i.e. at := P b + S (resp. P t °" := P t ° — 5). Such an alternative choice is used in practice 
by a market maker to capture market orders flow of undecided traders at the best quotes, 
hence to get priority in the order execution w.r.t. limit order at current best/ask quotes, 
and can be taken into account in our modelling with discrete spread of tick size S. 

There is then a tradeoff between a larger performance for a quote at the current best 
bid (resp. ask) price, and a smaller performance for a quote at a higher bid price, but 
with faster execution. The submission and cancellation of limit orders are for free, as they 
provide liquidity to the market, and are thus stimulated. Actually, market makers receive 
some fixed rebate once their limit orders are executed. The agent is assumed to be small 
in the sense that she does not influence the bid-ask spread. The limit order strategies are 
then modelled by a continuous time predictable control process: 

«r fce = (Q b t ,Q?,L b t ,L a ), t>0, 

where L = (L b ,L a ) valued in [0,£] 2 , i > 0, represents the size of the limit buy/sell order, 
and Q = (Q b ,Q a ) represent the possible choices of the bid/ask quotes either at best or 
at marginally improved prices, and valued in Q = Q b x Q a , with Q b = {Bb,Bb + }, Q a = 
{Ba,Ba_}: 

• Bb: best-bid quote, and Bb+: bid quote at best price plus the tick 

• Ba: best-ask quote, and £?a_: ask quote at best price minus the tick 

Notice that when the spread is equal to one tick <5, a bid quote at best price plus the tick is 
actually equal to the best ask, and will then be considered as a buy market order. Similarly, 
an ask quote at best price minus the tick becomes a best bid, and is then viewed as a sell 
market order. In other words, the limit orders Qt = {Q b ,Qt) take values in Q(S t -), where 
Q(s) = Q b x Q a when s > 5, Q(s) = {Bb} x {Ba} when s = 5. We shall denote by Q b = 
Q b for i > 1, and Q b = {Bb} for i = 1, and similarly for Qf for i £ I rn . 

We denote at any time t by TT b (Q b , P t , St) and ir a (Qf, P t , St) the bid and ask prices of 
the market maker, where the functions ir b (resp. 7r a ) are defined on Q b x P x S (resp. 
Q"xPx§) by: 

7T b ( q b ,p, S ) 

7r a (q a ,p,s) 

We shall denote by ft b {q b ,p) = ir b (q b ,p, s), irf(q a ,p) = TT a (q a ,p,s) for s = i5, i G 1 



p- 


s 

2' 


for 


q" 


= Bb 


p-l 


+ 6 


for 


q b 


= Bb + . 


p + 


s 

2' 


for 


q a 


= Ba 


p+l 


-5 


for 


q a 


= Ba_ 
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Remark 2.1 One can take into account proportional rebates received by the market ma- 
kers, by considering; ir b (q b ,p,s) = (p - f + 51 qb=Bb+ )(l - p), ir a (q a ,p,s) = (p + § - 
<51qa=B a _)(l + p), for some p G [0,1), or per share rebates with: 7r b (q b ,p,s) = p — | + 
(51 gb=Bfc+ - p, vr a (g a ,p,s) = p+ | - <51 g a =Ba _ +p, for some p > 0. 

The limit orders of the agent are executed when they meet incoming counterpart market 
orders. Let us then consider the arrivals of market buy and market sell orders, which 
completely match the limit sell and limit buy orders of the small agent, and modelled by 
independent Cox processes N a and N b . The intensity rate of Nf is given by X a (Q1,St) 
where A a is a deterministic function of the limit quote sell order, and of the spread, satisfying 
X a (Ba,s) < \ a (Bci-, s). This natural condition conveys the price/priority in the order 
execution in the sense that an agent quoting a limit sell order at ask price P a ~ will be 
executed before traders at the higher ask price P a , and hence receive more often market buy 
orders. Typically, one would also expect that A a is nonincreasing w.r.t. the spread, which 
means that the larger is the spread, the less often the market buy orders arrive. Likewise, 
we assume that the intensity rate of N b is given by X b (Q b ,St) where X b is a deterministic 
function of the spread, and X b (Bb, s) < X b (Bb + , s). We shall denote by X^(q a ) = X a (q a , s), 
X b (q b ) = X b (q b ,s) for s = iS, i € l m . 

For a limit order strategy a make = (Q b , Q a , L b , L a ), the cash holdings X and the number 
of shares Y hold by the agent (also called inventory) follow the dynamics 

dY t = L b dN b -L a t dN?, (2.2) 
dX t = -7r b (Q b ,P t -,S t -)L b dN b + Tr a (Q?,P t -,S t -)L<ldN?. (2.3) 

Market order strategies. In addition to market making strategies, the investor may 
place market orders for an immediate execution reducing her inventory. The submissions 
of market orders, in contrast to limit orders, take liquidity in the market, and are usually 
subject to fees. We model market order strategies by an impulse control: 

a — \T n ,(,n)n>0, 

where (r n ) is an increasing sequence of stopping times representing the market order decision 
times of the investor, and ( n , n > 1, are T Tn -measurable random variables valued in [— e, e], 
e > 0, and giving the number of stocks purchased at the best-ask price if ( n > 0, or selled 
at the best-bid price if ( n < at these times. Again, we assumed that the agent is small 
so that her total market order will be executed immediately at the best bid or best ask 
price. In other words, we only consider a linear market impact, which does not depend on 
the order size. When posting a market order strategy, the cash holdings and the inventory 
jump at times r„ by: 

Yr n = Y r -+Cn, (2.4) 

X Tn = X T --c(( n ,P Tn ,S Tn ) (2.5) 

where 

c(e,p,s) = ep+\e\- + e 
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represents the (algebraic) cost function indicating the amount to be paid immediately when 
passing a market order of size e, given the mid price p, a spread s, and a fixed fee e > 0. 
We shall denote by Ci(e,p) = c(e,p,s) for s = iS, i £ I m . 

Remark 2.2 One can also include proportional fees p G [0, 1) paid at each market order 
trading by considering a cost function in the form: c(e,p, s) = (e + e|e|)p + (|e| + pe)| + e, 
or fixed fees per share with c(e,p, s) = ep + |e| (| + p) + e. 

In the sequel, we shall denote by A the set of all limit/market order trading strategies 
a = (a make ,a take ). 

2.3 Parameters estimation 

In most order-driven markets, available data are made up of Level 1 data that contain 
transaction prices and quantities at best quotes, and of Level 2 data containing the volume 
updates for the liquidity offered at the L first order book slices (L usually ranges from 5 
to 10). In this section, we propose some direct methods for estimating the intensity of the 
spread Markov chain, and of the execution point processes, based only on the observation of 
Level 1 data. This has the advantage of low computational cost, since we do not have to deal 
with the whole volume of Level 2 data. Yet, we mention some recent work on parameters 
estimation from the whole order book data [I], but involving heavier computations based 
on integral transforms. 

Estimation of spread parameters. Assuming that the spread S is observable, let us 
define the jump times of the spread process: 

Bo = 0, 9 n+1 = m£{t >9 n : S t ^ S t -} , Vn > 1. 

Prom these observable quantities, one can reconstruct the processes: 

N t = # {9j > : 9j < t} , t > 0, 
S n = S 6n , n> 0. 

Then, our goal is to estimate the deterministic intensity of the Poisson process (Nt)t, and 
the transition matrix of the Markov chain (S n ) n from a path realization with high frequency 
data of the tick-time clock and spread in tick time over a finite trading time horizon T, 
typically of one day. From the observations of K samples of S n , n = 1, . . . , K , and since the 
Markov chain (S n ) is stationary, we have a consistent estimator (when K goes to infinity) for 
the transition probability pij := P[5 n+ i = jS\S n = iS] = P[(S n+ ±,S n ) = (jS,iS)]/P[S n = 
iS] given by: 

K 

^2 1 {(S n+1 ,S n )=(jS,iS)} 

Pij = ^Hr- - < 2 - 6 ) 

X] l {S n =i8} 
n=l 
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For the estimation of the deterministic intensity function X(t) of the (non) homogeneous 
Poisson process (Nt), we shall assume in a first approximation a simple natural parametric 
form. For example, we may assume that A is constant over a trading day, and more 
realistically for taking into account intra-day seasonality effects, we consider that the tick 
time clock intensity jumps e.g. every hour of a trading day. We then assume that A is in 
the form: 



A(t) 



E^ 1 



{t k <t<t k+1 } 



where (tfc)fc i s a fi xe d and known increasing finite sequence of with to = 0, and (Xk)k is 
an unknown finite sequence of (0, oo). In other words, the intensity is constant equal to A^ 
over each period [tk, tk+i], and by assuming that the interval length t^+i — tj. is large w.r.t. 
the intensity A& (which is the case for high frequency data), we have a consistent estimator 
of Afc, for all k, and then of X(t) given by: 



A* 



tk+i 



tk+i — tk 



(2.7) 



We performed these two estimation procedures (2.6) and (2.7) on SOGN.PA stock on 
April 18, 2011 between 9:30 and 16:30 in Paris local time. We used tick-by-tick level 1 data 
provided by Quanthouse, and fed via a OneTick Timeseries database. 

In table [TJ we display the estimated transition matrix: first row and column indicate 
the spread value s = id and the the cell ij shows pij. For this stock and at this date, the 
tick size was 5 = 0.005 euros, and we restricted our analysis to the first 6 values of the 
spread {m = 6) due to the small number of data outside this range. 



spread 


0.005 


0.01 


0.015 


0.02 


0.025 


0.03 


0.005 





0.410 


0.220 


0.160 


0.142 


0.065 


0.01 


0.201 





0.435 


0.192 


0.103 


0.067 


0.015 


0.113 


0.221 





0.4582 


0.147 


0.059 


0.02 


0.070 


0.085 


0.275 





0.465 


0.102 


0.025 


0.068 


0.049 


0.073 


0.363 





0.446 


0.03 


0.077 


0.057 


0.059 


0.112 


0.692 






Table 1: Estimation of the transition matrix (pij) for the underlying spread of the stock 
SOGN.PA on April 18, 2011. 

In table [2j we display the estimate for the tick time clock intensity X(t) by assuming 
that it is piecewise constant and jumps only at 10:30, 11:30, 16:30 in Paris local time. 
In figure [TJ we plot the tick time clock intensity by using an affine interpolation, and 
observed a typical U-pattern. Hence, a further step for the estimation of the intensity 
could be to specify a parametric form for the intensity function fitting U pattern, e.g. 
parabolic functions in time, and then use a maximum likelihood method for estimating the 
parameters. 

Estimation of execution parameters. When performing a limit order strategy a make , 
we suppose that the market maker permanently monitors her execution point processes 
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Time 


10:30 


11:30 


12:30 


13:30 


14:30 


15:30 


16:30 


Clock Intensity X(t) 


1.654 


0.799 


0.516 


0.377 


0.632 


1.305 


2.113 



Table 2: Estimation of the tick time clock (hourly basis) for the stock SOGN.PA on April 
18, 2011. Tick time clock intensity X(t) is expressed in second -1 . 



10:30:00 AM 



11:30:00 AM 



12:30:00 PM 



Figure 1: Plot of tick time clock intensity estimate for the stock SOGN.PA on April 18, 
2011 expressed in second -1 (affine interpolation). 



N a and N b , representing respectively the number of arrivals of market buy and sell orders 
matching the limit orders for quote ask Q a and quote bid Q b . We also assume that there 
is no latency so that the observation of the execution processes is not noisy. Therefore, 
observable variables include the quintuplet: 



(N t a ,N b ,Q?,Q b ,S t )e 



x Q a x Q b x § , t £ [0, T] 



Moreover, since N a and N b are assumed to be independent, and both sides of the order 
book can be estimated using the same procedure, we shall focus on the estimation for the 
intensity function X b (q b , s), q b G Q b = {Bb, Bb + }, s£§ = 5I m , of the Cox process N b . 

The estimation procedure for X b (q b , s) basically matchs the intuition that one must count 
the number of executions at bid when the system was in the state (g b , s) and normalize this 
quantity by the time spent in the state (q b , s). This is justified mathematically as follows. 
For any (q b ,s = iS) 6 Q b x S, let us define the point process 



b,q",i 



1 



{Q b u =q,S u -=i6} dN ui t > 0, 
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which counts the number of jumps of N b when (Q b , S) was in state (q b , s = iS). Then, for 
any nonnegative predictable process (H t ), we have 



E 



H t dK 



b,q b ,i 



E 
E 
E 



H t l {Q b =q b ,S t -=i5} dN t 
l t l {Q b t =q»,S t -=i5} 



ftlfnb — be. —;x}\ b (Qt, St)dt 



H tl{Qb =q b jSt _ =iS }>H(q b )dt 



(2.8 



where we used in the second equality the fact that X b (Q b ,St) is the intensity of N b . The 
relation (2.8) means that the point process N b,qh,% admits for intensity X b {q b )l^qb =q b s t ^=iS}- 



By defining 



Jo 



1 {Q b u =q,S u -=i5}du 



as the time that (Q b ,S) spent in the state (q b ,s = i5), this means equivalently that the 
process M b,q ' l = N b ' q 'I . , where A b ' q ' % = inf{u > : Tu Q ' l > t} is the cad-lag inverse of 

<yb,q b ,i^ - ls a p i sson process with intensity \i(q b )- By assuming that T^' q '* is large w.r.t. 
\i(q b ), which is the case when (S n ) is irreducible (hence recurrent), and for high-frequency 
data over [0,T], we have a consistent estimator of X b (q b ) given by: 



' T 



(2.9) 



Similarly, we have a consistent estimator for Xf(q a ) given by: 



7-a,q a ,i ' 
T 



(2.10) 



where N^' q ' l counts the number of executions at ask quote q a and for a spread iS, and 
7~t' 9 '* is the time that (Q a , S) spent in the state (q a , s = iS) over [0, T]. 

Let us now illustrate this estimation procedure on real data, with the same market data 
as above, i.e. tick-by-tick level 1 for SOGN.PA on April 18, 2011, provided by Quanthouse 
via OneTick timeseries database. Actually, since we did not perform the strategy on this 
real-world order book, we could not observe the real execution processes N b and N a . We 
built thus simple proxies N b,qb ' 1 and N a >i a >\ for q b = Bb, Bb+, q a = Ba, Ba,-, i = 1, . . . ,m, 
based on the following rules. Let us also assume that in addition to (Sg n ) n , we observe at 
jump times 9 n of the spread, the volumes (Vq ,Vg ) offered at the best ask and best bid 
price in the LOB together with the cumulated market order quantities and fig^ 1 " 

arriving between two consecutive jump times 9 n and # n+ i of the spread, respectively at 
best ask price and best bid price. We finally fix an arbitrarily typical volume Vq, e.g. Vq = 
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100 of our limit orders, and define the proxys 

fib,q ,i anc J ftarf>,i &t timeg ^ by . 

f-jb,Bb + ,i _ jr T b,Bb + ,i -. jT T b,Bb + ,i _ n 

N 0n + 1 ~ N e n + 1 ^ Vo< ^ELL^ n=lS ^ ,N Q - 

iTjb,Bb,i -jTjb,Bb+,i TTjb,Bb,i 

iy o n+1 ~ iy e n +^ Vo+vL< , & sELL jSen=lS y^o - u 

^n+l = N 0n + 1 { Vo<{) BUY tSgn=iS yN =0 

together with a proxy for the time spent in spread iS: 

V n +i = V n + 9 n)l{S 8n =iS}, V = 0. 

The interpretation of these proxies is the following: we consider the case where the (small) 
market maker instantaneously updates her quote Q b (resp. Q a ) and volume L b < Vq (resp. 
L a < Vq) only when the spread changes exogenously, i.e. at dates (0 n ), so that the spread 
remains constant between her updates, not considering her own quotes. If she chooses to 
improve best price i.e Q b dn = Bb+ (resp. Q% — BaJ) she will be in top priority in the 
LOB and therefore captures all incoming market order flow to sell (resp. buy). Therefore, 
an unfavourable way for (under)-estimating her number of executions is to increment N b 
(resp. N a ) only when total traded volume at bid ( res P- total volume traded at ask 

was g rea ter than Vq. If the market maker chooses to add liquidity to the best prices 
i.e. Q b 9n = Bb (resp. Qg = Ba), she will be ranked behind Vq (resp. V e a ) in LOB priority 
queue. Therefore, we increment N b (resp. N a ) only when the total traded volume at bid 
^dn L \ L ( res P- total volume traded at ask was greater than Vq + Vq (resp. Vq + V e a n ). 

We then provide a proxy estimate for X b (q b ), Xf{q a ) by: 

jyb,q b ,i jya,q a ,i 

^ b M b ) = K(v a ) = -%r . (2.11) 



We performed the estimation procedure (2.11). In table [3] we computed Xf(q a ) and 



X b (q ) for i = 1, ... ,6, and limit order quotes q = Bb+,Bb, q a = Ba,Ba-. Due to the 
lack of data, estimate for large values of the spread are less robust. In this table, each row 
corresponds to a choice of the spread and each column to a choice of the quotes. 

In figure [2] we plotted this estimated intensity as a function of the spread, i.e. s = iS — >■ 
X b (q b ), Xi(q a ) for q b G Q b , and q a £ Q a . As one would expect, (Af (.), A* (.)) are decreasing 
functions of i for the small values of i which matches the intuition that the higher are the 
(indirect) costs, the smaller is market order flow. Surprisingly, for large values of i this 
function becomes increasing, which can be due either to an estimation error, caused by the 
lack of data for this spread range, or a "gaming" effect, in other word liquidity providers 
increasing their spread when large or autocorrelated market orders come in. 
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Ba 


Ba- 


Bb 


Bb+ 


0.005 


0.0539 


0.1485 


0.0718 


0.1763 


0.010 


0.0465 


0.0979 


0.0520 


0.1144 


0.015 


0.0401 


0.0846 


0.0419 


0.0915 


0.020 


0.0360 


0.0856 


0.0409 


0.0896 


0.025 


0.0435 


0.1009 


0.0452 


0.0930 


0.030 


0.0554 


0.1202 


0.0614 


0.1255 



Table 3: Estimate of execution intensities A?(g a ) and X^(q b ), expressed in s _1 for SOGN.PA 
on April 18, 2011, between 9:30 and 16:30 Paris local time. Each row corresponds to a 
spread value as a multiple of 5 = 0.005. Each column corresponds to a limit order quote. 

0.2 t— 



0.18 




0.04 



0.02 - 

o 4 — ,— — ^- — ,— — ,— — , 

0.005 0.01 0.015 0.02 0.025 0.03 

Figure 2: Plot of execution intensities estimate as a function of the spread for the stock 
SOGN.PA on the 18/04/2011, expressed in s" 1 (affine interpolation). 

3 Optimal limit /market order strategies 
3.1 Control problem formulation 

Our market model in the previous section is completely determined by the state variables 
(X, Y, P, S) controlled by the limit/marker order trading strategies a € A. 

The objective of the market maker is the following. She wants to maximize over a 
finite horizon T the profit from her transactions in the LOB, while keeping under control 
her inventory (usually starting from zero), and getting rid of her inventory at the terminal 
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date: 



maximize 



E[U(X 2 



1 



g(Y t )dt] 



(3.1) 



over all limit/market order trading strategies a = (a ma ,a ) in A such that Yt = 0. 
Here U is an increasing reward function, 7 is a nonnegative constant, and g is a nonnegative 
convex function, so that the last integral term g(Yt)dt penalizes the variations of the 
inventory. Typical frameworks include the two following cases: 

• Mean-quadratic criterion: U(x) = x, 7 > 0, g(y) = y 2 . 

• Exponential utility maximization: U(x) = — exp(— rjx), 7 = 0. 

Let us first remove mathematically the terminal constraint on the inventory: Yt = 0, 
by introducing the liquidation function L(x, y,p, s) defined on M 2 x P x § by: 



L(x,y,p,s) 



c(-y,p,s) = x + yp 



\y\ 



£. 



This represents the value that an investor would obtained by liquidating immediately by a 
market order her inventory position y in stock, given a cash holdings x, a mid-price p and 



a spread s. Then, problem (3.1) is formulated equivalently as 



maximize B[U(L{X T , Y t , Pt, St)) - 7 f g{Y t )dt] 

Jo 



(3.2) 



over all limit/market order trading strategies a = (a m , a ) in A. Indeed, the maximal 



value of problem ( |3.1| ) is clearly smaller than the one of problem (3.2) since for any a £ A 
s.t. Yt = 0, we have L(Xt,Yt, Pt, St) = Xt- Conversely, given an arbitrary a £ A, let 
us consider the control a G A, coinciding with a up to time T, and to which one add at 
the terminal date T the market order consisting in liquidating all the inventory Yt- The 
associated state process (X,Y,P,S) satisfies: X t = X t , Y t = Y t for t < T, and Xt = 
L(Xt,Yt, Pt, St), Yr = 0. This shows that the maximal value of problem (3.2) is smaller 



and then equal to the maximal value of problem (3.1 ) 



We then define the value function for problem (3.2) (or (|3.1 



v(t,z,s) 



sup E t)ZjS 



U{L{Z T ,S T ))-1 / g(Y u )du 



T 



(3.3) 



for t E [0, T], z = (x,y,p) G M 2 x P, s G S. Here, given a £ A, E tjZjS denotes the ex- 
pectation operator under which the process (Z,S) = (X,Y, P, S) solution to (2.1)-(2.2)- 
(2.3)-(2.4)-(2.5), with initial state (Z t -,S t -) = (z,s), is taken. Problem (3.3) is a mixed 
regular/impulse control problem in a regime switching jump-diffusion model, that we shall 
study by dynamic programming methods. Since the spread takes finite values in S = 5I m , 
it will be convenient to denote for i G I m , by Vi(t,z) = v(t,z,i5). By misuse of notation, 
we shall often identify the value function with the M m -valued function v = ("Ui)ieim defined 
on [0, T] x M 2 x P. 
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3.2 Dynamic programming equation 

For any q = (q b ,q a ) G Q, £ = (£ b ,£ a ) G [0,-£] 2 , we consider the second-order nonlocal 
operator: 

C q ' e (p(t,x,y,p,s) = Vip(t,x,y,p,s) + R(t)ip(t,x,y,p,s) 

+ \\q\ s) [<p(t, T b {x, y,p, s, q b , £ b ),p, s) - <p(t, x, y, p, s)] 

+ X a (q a , s) [<p(t, T a (x, y,p, s, q a , i a ),p, s) - tp(t, x, y,p, s)] , (3.4) 

for (t,x,y,p,s) G [0, T] x I 2 x P x S, where 

m 

R(t)ip(t,x,y,p,s) = ^2rij(t)[if(t,x,y,p,j6) - ip(t,x,y,p,iS)], for s = id, i G I m , 

i=i 

and T b (resp. T a ) is defined from R 2 x P x S x Q b x R+ (resp. R 2 x P x S x Q a x R + into 
R 2 ) by 

T b (x,y,p,s,q b ,e b ) = (x - 7T b (q b ,p, s)£ b , y + i b ) 
T a (x,y,p, S ,q a ,£ a ) = [x + ir a (q a ,p, s)£ a , y - £ a ). 



The first term of £ 9 ' in (3.4) correspond to the infinitesimal generator of the diffusion 
mid-price process P, the second one is the generator of the continuous-time spread Markov 
chain S, and the two last terms correspond to the nonlocal operator induced by the jumps 
of the cash process X and inventory process Y when applying an instantaneous limit order 
control (Q t ,L t ) = (q,£). 

Let us also consider the impulse operator associated to market order control, and defined 

by 

M(f(t,x,y,p,s) = sup ip{t,T take (x,y,p, s,e),p,s), 

ee[— e,e] 

where T take is the impulse transaction function defined from R 2 x P x S x R into R 2 by: 

T take (x,y,p,s,e) = (x - c(e,p,s),y + e 



The dynamic programming equation (DPE) associated to the control problem (3.3) is 
the quasi- variational inequality (QVI): 

min [ — — sup C q/ v + 79 , v - Mv] = 0, on [0, T) x R 2 x P x S(3.5) 

dt (?,f)eS(s)x[o,f"p 

together with the terminal condition: 

v(T,x,y,p,s) = U(L(x,y,p,s)), V(x, G R 2 x P x S. (3.6) 
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mm 



This is also written explicitly in terms of system of QVIs for the functions Vi, i £ I m : 

- Vv,i - ^2,rij{t)[vj{t,x,y,p) - Vi(t,x,y,p)] 

sup X b (q b )[v i (t,x-7r b l (q b ,p)£ b ,y + £ b ,p)-v i (t,x,y,p)} 

sup \?(q a )[v i (t,x + ir?(q a ,p)£ a ,y-e,p)-v i (t,x,y,p)} +jg(y) ; 
(9°/ a )eQ?x[o,f] 

Vi(t,x,y,p)- sup Vi(t, x - a(e,p),y + e,p) = 0, 

ee[— e,e] 

for (t,x,y,p) G [0,T) x I 2 x P, together with the terminal condition: 

Vi(T,x,y,p) = U(Li(x,y,p)), V(x, 6 M 2 x P, 

where we set L{(x,y,p) = L(x,y,p,i5). 

By methods of dynamic programming, one can show by standard arguments that the 



value function v is the unique viscosity solution to the QVI (3.5)- (3.6) under suitable growth 
conditions depending on the utility function U and penalty function g. We next focus on 
some particular cases of interest for reducing remarkably the number of states variables in 
the dynamic programming equation DPE. 

3.3 Two special cases 

1. Mean criterion with penalty on inventory 

We first consider the case as in |12j where: 

U(x) = x, and (Pt)t is a martingale. (3-7) 

The martingale assumption of the stock price under the historical measure under which the 
market maker performs her criterion, reflects the idea that she has no information on the 
future direction of the stock price. Moreover, by starting typically from zero endowment 
in stock, and by introducing a penalty function on inventory, the market maker wants to 
keep an inventory that fluctuates around zero. 

In this case, and exploiting similarly as in [2] the martingale property that Vp = 0, 



we see that the solution v = (vj)iei m to the dynamic programming system (3.5)-(3.6) is 
reduced into the form: 

Vi(t,x,y,p) = x + yp + (pi(t,y), (3.8) 
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where 



i e i m is solution the system of integro-differential equations (IDE): 



mm 



dt 



^2rij(t)[(f>j(t,y) - <fti{t,y)\ 



sup K(q b )Mt,y + n-Mt,y) 

(q b ^ b )eQ b i x[o,e\ 

AS 



AS 



51 



q b =Bb^ 



sup \?( q ")[ ( f )i (t,y-t a )-<l> i (t,y) + (---51 qa=Ba _)l a } + ig {y) 



cf>i(t,y)- sup [(/>i(t,y + e) 

ee[—e,e] 



iS, 



0, 



together with the terminal condition: 

<fti(T,y) 



AS 
^2 _£ ' 



These one-dimensional IDEs can be solved numerically by a standard finite-difference 
scheme by discretizing the time derivative of (ft, and the grid space in y. The reduced 
form (3.8) shows that the optimal market making strategies are price independent, and 
depend only on the level of inventory and of the spread, which is consistent with stylized 
features in the market. Actually, the IDEs for (0j) even show that optimal policies do not 
depend on the martingale modeling of the stock price. 

2. Exponential utility criterion 

We next consider as in [1J a risk averse market marker: 

U(x) = -exp(-r ? x), x 6 M, t] > 0, 7 = 0, (3.9) 
and assume that P follows a Bachelier model: 

dP t = bdt + adW t . 

Such price process may take negative values in theory, but at the short-time horizon where 
high-frequency trading take place, the evolution of an arithmetic Brownian motion looks 
very similar to a geometric Brownian motion as in the Black-Scholes model. 

In this case, we see, similarly as in [8], that the solution v = (wj)iei m to the dynamic 



(3.10) 



programming system (3.5 )-( |3.6| is reduced into the form: 

Vi{t,x,y,p) = U{x + yp)tpi{t,y), 



where ip = (y>i)iei m is solution the system of one-dimensional integro-differential equations 
(IDE): 



max 



dip- 1 m 



dt 



AS 



inf _ A^)[exp ( - V (- - 61 qb=Bb+ )l b )<p t (t, y + f) - Vi {t, y)] 

inf _ \?(q a )[exp {-vCi- ^=Ba^)i a )p l {t,y - t) - ^(t,y)] 
(q a ,£ a )&Qfx[o,e] 2 

i5 

tfi(t,y)- inf [exp(r]\e\— +rjs)(pi(t,y + e) 

ee -e,e 2 



0, 
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together with the terminal condition: 

<Pi{T,y) 



exp(r?|y|^-). 



Actually, we notice that the reduced form (3.10) holds more generally when P is a Levy 
process, by using the property that in the case: VU{x + yp) = tp(y)U(x + yp) for some 
function ip depending on the characteristics of the generator V of the Levy process. As 



in the above mean- variance criterion case, the reduced form (3.10) shows that the optimal 
market making strategies are price independent, and depend only on the level of inventory 
and of the spread. However, it depends on the model (typically the volatility) for the stock 
price. 



4 Computational results 

In this section, we provide numerical results in the case of a mean criterion with penalty 
on inventory, that we will denote within this section by a*. We used parameters shown 
in table Q together with transition probabilities (Pij)i<i,j<M calibrated in table and 
execution intensities calibrated in table ([3]), slightly modified to make the bid and ask sides 
symmetric. 



Parameter 



Signification 



8 Tick size 

p Per share rebate 

e Per share fee 

eo Fixed fee 

X(t) Tick time intensity 

(a) Market parameters 



Value 



0.005 
0.0008 
0.0012 
10" 6 

EE IS" 1 



Parameter 


Signification 


Value 


U(x) 


Utility function 


X 


7 


Inventory penalization 


5 


t 


Max. volume make 


100 


e 


Max. volume take 


100 



(b) Optimization parameters 



Parameter 


Signification 


Value 


T 


Length in seconds 


300 s 


ymin 


Lower bound shares 


-1000 


Umax 


Upper bound shares 


1000 


n 


Number of time steps 


100 


771 


Number of spreads 


6 



(c) Discretization/localization parameters 



Parameter 


Signification 


Value 


N MC 


Number of paths for MC simul. 


10 5 


At 


Euler scheme time step 


0.3 s 


to 


B/A qty for bench, strat. 


100 


x 


Initial cash 





yo 


Initial shares 





Po 


Initial price 


45 



(d) Backtest parameters 



Table 4: Parameters 



Shape of the optimal policy. The reduced form (3.8) shows that the optimal policy 
a* does only depend on time t, inventory y and spread level s. One can represent a* as 
a mapping a* : M + x E x S — > A with a* = (a*' make , a * ,take ) thus it divides the space 
1+ x K x § in two zones M and T so that at M = ( a *' mafce , 0) and at- = (0,a*' take ). 
Therefore we plot the optimal policy in one plane, distinguishing the two zones by a color 
scale. For the zone Ai, due to the complex nature of the control, which is made of four 
scalars, we only represent the prices regimes. 
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Ba-,Bb+ 






Ba,Bb+ 




Ba-,Bb ^yS* 
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MARKET 




Ba.Bb 


SELL AT 
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(num. of si 
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Bb+/ 
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/ 




\ 

Ba,Bb+ 


/ / 

Ba-,Bb 

/ 




BUY AT 


V 


\ / 


SELL AT 


MARKET 




\ 1 


MARKET 






\ 

Bb 





(a) near date (b) near date T 

Figure 3: Stylized shape of the optimal policy sliced in YS. 



Moreover, when using constant tick time intensity X(t) = A and in the case where T S> j 
we can observe on numerical results that the optimal policy is mainly time invariant near 
date 0; on the contrary, close to the terminal date T the optimal policy has a transitory 
regime, in the sense that it critically depends on the time variable t. This matches the 
intuition that to ensure the terminal constraint Y? = 0, the optimal policy tends to get rid 
of the inventory more aggressively when close to maturity. In figure [3j we plotted a stylized 
view of the optimal policy, in the plane (y, s), to illustrate this phenomenon. 

Benchmarked empirical performance analysis. We made a backtest of the optimal 
strategy a*, on simulated data, and benchmarked the results with the three following 
strategies: 

• Optimal strategy without market orders (WoMO), that we denote by ot w : this strategy 
is computed using the same IDEs, but in the case where the investor is not allowed to use 
market orders, which is equivalent to setting e = 0. 

• Constant strategy, that we denote by a c : this strategy is the symmetric best bid, best 
ask strategy with constant quantity £q on both sides, or more precisely a c := (a c,make , 0) 
with a c t ' make = (Bb, Ba,lo,&o)- 

• Random strategy, that we denote by a r : this strategy consists in choosing randomly 
the price of the limit orders and using constant quantities on both sides, or more precisely 



a 



Q 



r,make 



0) with a t 



r,make 



where (? fe ,? a ) is s.t. Vt e [0;T] , P(<^ 



i 

2' 



Bb) = Pfe 6 = Bb + ) = P( ? f = Ba) = P(?f = Ba- 

Our backtest procedure is described as follows. For each strategy a G {a*, a w , a c , a r }, 
we simulated N MC paths of the tuple (X a , Y a ,P, S, N a < a , N b ' a ) on [0, T], according to eq. 



(2.1)-(2.2)-(2.3)-(2.4)-(2.5), using a standard Euler scheme with time-step At. Therefore 
we can compute the empirical mean (resp. empirical standard deviation), that we denote 
by m(.) (resp. cr(.)), for several quantities shown in table 

Optimal strategy a* demonstrates significant improvement of the information ratio 
IR(Xt) ■= m(X'r) / <j(Xt) compared to the benchmark, which is confirmed by the plot of 
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optimal a* 


WoMO a w 


constant a c 


random a r 


Terminal wealth 


m{X T ) / (j(X t ) 


2.117 


1.999 


0.472 


0.376 




m(X T ) 


26.759 


25.19 


24.314 


24.022 




a(X T ) 


12.634 


12.599 


51.482 


63.849 


Num. of exec, at bid 


m{N%) 


18.770 


18.766 


13.758 


21.545 




a(N^) 


3.660 


3.581 


3.682 


4.591 


Num. of exec, at ask 


rn(N^) 


18.770 


18.769 


13.76 


21.543 






3.666 


3.573 


3.692 


4.602 


Num. of exec, at market 


m(N^ arket ) 


6.336 













a(N^ arket ) 


2.457 











Maximum Inventory 


M SU P se[ 0;T] \Y 3 \) 


241.019 


176.204 


607.913 


772.361 






53.452 


23.675 


272.631 


337.403 



Table 5: Performance analysis: synthesis of benchmarked backtest (10 5 simulations). 



the whole empirical distribution of Xt (see figure Q). 

Performance empirical distribution (spline interpolation) 



0.0300 
0.0275 
0.0250 
0.0225 
; 0.020D 
: D.D17B 
I D.G150 
' 0.0125 
0.0 100 
0.0075 
0.0050 
0.0025 
0.000'J 
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Figure 4: Empirical distribution of terminal wealth Xt (spline interpolation). 

Even if absolute values of m(Xy) are not representative of what would be the real-world 
performance of such strategies, these results prove that the different layers of optimization 
are relevant. Indeed, one can compute the ratios [m(Xf) - m(Xf)] /a(Xf) = 0.194 and 
[m(Xp ) — m(Xy )1 /a(Xj, ) = 0.124 that can be interpreted as the performance gain, 



measured in number of standard deviations, of the optimal strategy a* compared respec- 
tively to the constant strategy a c and the WoMO strategy a w . Another interesting statistics 
is the surplus profit per trade \m{X^*) — m(X£; c )] / m(N T ,a ) + m{N T ' a ) + m(N T n,arket ' a ) 
0.056 euros per trade, recalling that the maximum volume we trade is I = e = 100. Note 
that for this last statistics, the profitable effects of the per share rebates p are partially 
neutralized because the number of executions is comparable between a* and a c ; therefore 
the surplus profit per trade is mainly due to the revenue obtained from making the spread. 
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To give a comparison point, typical clearing fee per execution is 0.03 euros on multilateral 
trading facilities, therefore, in this backtest, the surplus profit per trade was roughly twice 
the clearing fees. 

We observe in the synthesis table that the number of executions at bid and ask are 
symmetric, which is also confirmed by the plots of their empirical distributions in figure 
(jHj). This is due to the symmetry in the execution intensities X b and A a , which is reflected 
by the symmetry around y = in the optimal policy. 



N bid empirical distribution (spline interpolation) N ask empirical distribution (spline interpolation) 




2.5 5.0 7.5 10.0 12.5 15.0 17.5 20.0 22.5 25.0 27.5 2.5 5.0 75 10.0 12 5 150 17.5 20.0 22.5 25.0 27.5 

Value Value 



\-m- Optimal -*-WoMQ Cst | Optimal -«-WoMO Gst| 

(a) N Bid empirical distribution (b) N Ask empirical distribution 

Figure 5: Empirical distribution of the number of executions on both sides. 

Moreover, notice that the maximum absolute inventory is efficiently kept close to zero 
in a* and a w , whereas in cf and a r it can reach much higher values. The maximum 
absolute inventory is higher in the case of a* than in the case a w due to the fact that 
a* can unwind any position immediately by using market orders, and therefore one may 
post higher volume for limit orders between two trading at market, profiting from reduced 
execution risk. 

Efficient frontier. An important feature of our algorithm is that the market maker can 
choose the inventory penalization parameter 7. To illustrate its influence, we varied the 
inventory penalization 7 from 50 to 6.10~ 2 , and then build the efficient frontier for both 
the optimal strategy a* and for the WoMO strategy a w . Numerical results are provided in 
table §6§ and a plot of this data is in figure Q. 

We display both the "gross" information ratio IR(X^ ) := m(X^ )/a(X^ ) and the 
"net" information ratio NIR(X^ ) := (m(Jy ) — m(X?f)) jo(Xj. ) to have more precise 
interpretation of the results. Indeed, m(X^) seems largely overestimated in this sim- 
ulated data backtest compared to what would be real- world performance, for all a £ 
{a* ,a w , a c ,a T }. Then, to ease interpretation, we assume that a c has zero mean per- 
formance in real-world conditions, and therefore offset the mean performance m(X^ ) by 
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7 


a(Xf) 


m{Xf) 


a{Xf) 




m(xf) 


NIR(X£*) 


50.000 


5.283 


12.448 


4.064 


9.165 


2.356 


-2.246 


25.000 


7.562 


18.421 


7.210 


16.466 


2.436 


-0.779 


12.500 


9.812 


22.984 


9.531 


20.971 


2.343 


-0.135 


6.250 


11.852 


25.932 


11.749 


24.232 


2.188 


0.136 


3.125 


14.546 


28.153 


14.485 


26.752 


1.935 


0.263 


1.563 


15.819 


28.901 


16.830 


28.234 


1.827 


0.289 


0.781 


19.088 


29.952 


19.593 


29.145 


1.569 


0.295 


0.391 


20.898 


30.372 


20.927 


29.728 


1.453 


0.289 


0.195 


23.342 


30.811 


23.247 


30.076 


1.320 


0.278 


0.098 


25.232 


30.901 


24.075 


30.236 


1.225 


0.261 


0.049 


26.495 


31.020 


24.668 


30.434 


1.171 


0.253 


0.024 


27.124 


30.901 


25.060 


30.393 


1.139 


0.242 


0.012 


27.697 


31.053 


25.246 


30.498 


1.121 


0.243 


0.006 


28.065 


30.998 


25.457 


30.434 


1.105 


0.238 



Table 6: Efficient frontier data 



the constant — m(X?f) when computing the NIR. This has simple visual interpretation as 
shown in figure Q. 
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Figure 6: Efficient frontier plot 

Observe that highest (net) information ratio is reached for 7 ~ 0.8 for this set of 
parameters. At this point 7 ~ 0.8, the annualized value of the NIR (obtained by simple 
extrapolation) is 47, but this simulated data backtest must be completed by a backtest 
on real data. Qualitatively speaking, the effect of increasing the inventory penalization 
parameter 7 is to increase the zone T where we trade at market. This induces smaller 
inventory risk, due to the fact that we unwind our position when reaching relatively small 
values for \y\. This feature can be used to enforce a soft maximum inventory constraint 
directly by choosing 7. 
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